Skip to content

Update the cache row dim calculation in TBE SSD #4480

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

q10
Copy link
Contributor

@q10 q10 commented Jul 12, 2025

Summary: - The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions. This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062

Copy link

netlify bot commented Jul 12, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit be097b6
🔍 Latest deploy log https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/687574445028420008b7d13b
😎 Deploy Preview https://deploy-preview-4480--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D77321062

q10 added a commit to q10/FBGEMM that referenced this pull request Jul 14, 2025
Summary:

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062
@q10 q10 force-pushed the export-D77321062 branch 2 times, most recently from 8e9bcfb to 656d0ba Compare July 14, 2025 19:17
q10 added a commit to q10/FBGEMM that referenced this pull request Jul 14, 2025
Summary:

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D77321062

1 similar comment
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D77321062

q10 added a commit to q10/FBGEMM that referenced this pull request Jul 14, 2025
Summary:
Pull Request resolved: pytorch#4480

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062
@q10 q10 force-pushed the export-D77321062 branch 2 times, most recently from be04ecf to 04209be Compare July 14, 2025 20:30
q10 added a commit to q10/FBGEMM that referenced this pull request Jul 14, 2025
Summary:

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062
q10 added a commit to q10/FBGEMM that referenced this pull request Jul 14, 2025
Summary:

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062
@q10 q10 force-pushed the export-D77321062 branch from 04209be to ee4adb1 Compare July 14, 2025 20:31
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D77321062

@q10 q10 force-pushed the export-D77321062 branch from ee4adb1 to 696c39e Compare July 14, 2025 20:34
q10 added a commit to q10/FBGEMM that referenced this pull request Jul 14, 2025
Summary:
Pull Request resolved: pytorch#4480

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D77321062

q10 added a commit to q10/FBGEMM that referenced this pull request Jul 14, 2025
Summary:
Pull Request resolved: pytorch#4480

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062
@q10 q10 force-pushed the export-D77321062 branch from 696c39e to ba9cfe9 Compare July 14, 2025 20:49
Summary:

X-link: facebookresearch/FBGEMM#1537

- The current cache row dim calculation in TBE SSD assumes that optimizers have state sizes that are fixed relative to table dimensions.  This change updates the cache row dim calculation to account for optimizers whose states' sizes depends on the row length, such as Partial Rowwise Adam.

Reviewed By: emlin, jiawenliu64

Differential Revision: D77321062
@q10 q10 force-pushed the export-D77321062 branch from ba9cfe9 to be097b6 Compare July 14, 2025 21:18
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D77321062

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 619b6ab.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants